Vehicle-based media system with audio advertisement and external-device action synchronization feature

ABSTRACT

In one aspect, an example method to be performed by a vehicle-based media system includes (a) receiving audio content; (b) causing one or more speakers to output the received audio content; (c) using a microphone of the vehicle-based media system to capture the output audio content; (d) identifying reference audio content that has at least a threshold extent of similarity with the captured audio content; (e) identifying a computational action based at least on the identified reference audio content; and (f) sending, via a network interface of the vehicle-based media system, an instruction that causes an external computing device to perform the identified computational action.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority to U.S. Provisional Patent Application No. 62/714,619, filed on Aug. 3, 2018, U.S. Provisional Patent Application No. 62/714,621, filed on Aug. 3, 2018, and U.S. Provisional Patent Application No. 62/714,623, filed on Aug. 3, 2018, each of which is hereby incorporated by reference in its entirety. Additionally, this application hereby incorporates by reference the following U.S. Patent Applications, each filed on the same date as the present application: U.S. patent application Ser. No. 16/516,029, titled “Vehicle-Based Media System with Audio Ad and Visual Content Synchronization Feature” and U.S. patent application Ser. No. 16/516,035, titled “Vehicle-Based Media System with Audio Ad and Navigation-Related Action Synchronization Feature”.

USAGE AND TERMINOLOGY

In this disclosure, unless otherwise specified and/or unless the particular context clearly dictates otherwise, the terms “a” or “an” mean at least one, and the term “the” means the at least one.

In this disclosure, the term “connection mechanism” means a mechanism that facilitates communication between two or more components, devices, systems, or other entities. A connection mechanism can be a relatively simple mechanism, such as a cable or system bus, or a relatively complex mechanism, such as a packet-based communication network (e.g., the Internet). In some instances, a connection mechanism can include a non-tangible medium (e.g., in the case where the connection is wireless).

BACKGROUND

Vehicle-based media systems can receive media content to be presented to an occupant of a vehicle. For example, a vehicle-based media system can include an antenna configured to receive a radio signal, such as a satellite radio signal or a terrestrial radio signal, that is carrying audio content, and the vehicle-based media system can output the audio content through one or more speakers located within the vehicle.

SUMMARY

In one aspect, an example vehicle-based media system is disclosed. The example vehicle-based media system comprises (a) a user interface, (b) a microphone, (c) one or more processors, and (d) a non-transitory, computer-readable medium storing instructions that, when executed by the one or more processors, cause the vehicle-based media system to perform a set of acts comprising: (a) receiving audio content; (b) causing one or more speakers to output the received audio content; (c) using the microphone of the vehicle-based media system to capture the output audio content; (d) identifying reference audio content that has at least a threshold extent of similarity with the captured audio content; (e) identifying visual content based at least on the identified reference audio content; and (f) sending, via the network interface of the vehicle-based media system, an instruction that causes the external computing device to perform the identified computational action.

In another aspect, an example method to be performed by a vehicle-based media system is disclosed. The method includes (a) receiving audio content; (b) causing one or more speakers to output the received audio content; (c) using a microphone of the vehicle-based media system to capture the output audio content; (d) identifying reference audio content that has at least a threshold extent of similarity with the captured audio content; (e) identifying a computational action based at least on the identified reference audio content; and (f) sending, via a network interface of the vehicle-based media system, an instruction that causes an external computing device to perform the identified computational action.

In another aspect, an example non-transitory computer-readable medium is disclosed. The computer-readable medium has stored thereon program instructions that, upon execution by a vehicle-based media system, cause the vehicle-based media system to perform a set of operations comprising (a) receiving audio content; (b) causing one or more speakers to output the received audio content; (c) using a microphone of the vehicle-based media system to capture the output audio content; (d) identifying reference audio content that has at least a threshold extent of similarity with the captured audio content; (e) identifying a computational action based at least on the identified reference audio content; and (f) sending, via a network interface of the vehicle-based media system, an instruction that causes an external computing device to perform the identified computational action.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a simplified block diagram of an example computing device.

FIG. 2A is an example graphical user interface (“GUI”) in a first state.

FIG. 2B is the example GUI of FIG. 2A, but in a second state.

FIG. 2C is the example GUI of FIG. 2A, but in a third state.

FIG. 3 is a flow chart of an example method.

DETAILED DESCRIPTION

I. Overview

A vehicle-based media system receives audio content over a radio broadcast and presents the received audio content to an occupant of a vehicle, for instance, by playing the audio content through one or more speakers of the vehicle. The audio content received and played back by the vehicle-based media system can include various types of audio content. For instance, the audio content can include primary content, such as music, radio programs, or the like, and the audio content can include secondary content, such as advertisements, interspersed among the primary content.

Some vehicle-based media systems include a visual display for displaying visual information to an occupant of a vehicle. Such a vehicle-based media system can be configured to display information related to audio content being played back by the media system. For instance, while the media system is playing audio content, the media system can cause the visual display to display information identifying the audio content, such as a title and/or artist of a song, or identifying a source of the audio content, such as a radio station that is broadcasting the audio content. Such information to be displayed by the vehicle-based media system can be encoded in the radio broadcast that is carrying the audio content, and the vehicle-based media system can extract this information from the radio broadcast.

In practice, when the vehicle-based media system receives primary content from the radio broadcast, such as a song or a radio program, the vehicle-based media system can obtain, from the radio broadcast, and display, via its visual display, information identifying the song or radio program and/or a source of the song or radio program. For instance, for a terrestrial radio broadcast, such as a frequency modulation (FM) radio broadcast, information can be encoded into the radio signal according to the Radio Broadcast Data System (RBDS) communication protocol. Other examples are possible as well.

In some scenarios, however, the vehicle-based media system might not receive from the radio broadcast any visual information to be displayed, or the occupant of the vehicle may wish to receive additional or alternative information than that which the vehicle-based media system receives from the radio broadcast. For instance, when the vehicle-based media system receives secondary content, such as an advertisement, from the radio broadcast, the radio signal may not be encoded with visual information for display by the vehicle-based media system, or the radio signal may merely be encoded with visual information identifying the radio station broadcasting the advertisement or with various other visual information unrelated to the content of the advertisement. However, an occupant of the vehicle may wish to obtain further information about the advertisement, and/or a business associated with the advertisement may wish to provide further information to the occupant of the vehicle.

Disclosed herein are methods and systems to address this and potentially other issues. In an example method, while a vehicle-based media system is presenting an audio advertisement, the vehicle-based media system determines an audio fingerprint of the advertisement. For example, the vehicle-based media system can receive audio content. This received audio content can also be different types of audio content, including music by various artists or other radio programs (collectively, “entertainment content”) and/or audio advertisements (collectively, “advertisement content”).

Once this audio content is received, the vehicle-based media system may also cause one or more speakers in (or around) the vehicle to output the received audio content, and then, using a microphone of the system, capture the outputted audio content. In some examples, the vehicle-based media system can also selectively capture the audio content based on various attributes of the received audio content (e.g., capturing the advertisement content).

The vehicle-based media system may also identify reference audio content that has at least a threshold extent of similarity with the captured audio content and can do so in various ways. For instance, the reference audio content can include a plurality of reference audio fingerprint items, and the vehicle-based media system can generate audio fingerprint data associated with the captured audio content, compare the generated audio fingerprint data to the plurality of reference audio fingerprint items, and determine that a particular reference audio fingerprint item of the plurality of reference audio fingerprint items has at least a threshold extent of similarity with the generated audio fingerprint data. In another aspect, the vehicle-based media system may do some or all of this analysis locally or using other devices and/or processes (e.g., the vehicle-based media system can send the captured audio content (or a portion thereof) to a server, and the server can generate audio fingerprint data associated with the captured audio content and/or perform the comparative analysis). And, in yet another aspect, the vehicle-based media system can also identify reference audio content based on specific audio content with specific attributes (e.g., capturing advertisement content contained in the received audio content and using the captured advertisement content to identify reference audio content).

In any case, the vehicle-based media system may also use the identified reference audio content to identify a computational action associated with the advertisement and cause an external computing device (separate from the vehicle-based media system) to perform one or more identified computational actions, all while the vehicle-based media system presents the audio advertisement. In this way, once the advertisement that is being received by the vehicle-based media system has been identified, the identified advertisement can be used as a basis to identify a computational action.

In some examples, the computational action could include displaying particular visual content, such that identifying the computational action involves identifying particular visual content to be displayed based on the identified advertisement. For example, to enhance the occupant's experience, the vehicle-based media system can identify the advertisement and can use the identified advertisement as a basis to identify an action. In a further aspect, the vehicle-based media system can cause an external computing device, such as the occupant's mobile phone, to perform the identified action, while the vehicle-based media system presents at least a portion of the advertisement. In one example, the vehicle-based media system can receive and present a terrestrial radio advertisement for a restaurant and while doing so, the vehicle-based media system can cause the occupant's mobile phone to display an image of a food item available at the restaurant together with a scannable identifier, such as a QR code, that links to a corresponding coupon that the occupant can use at a participating location of the advertised business. To retrieve this content, in some examples, one or more servers could receive or be provisioned with respective visual content associated with each respective known advertisement, and the vehicle-based media system could retrieve, from the one or more servers, the respective visual content associated with the identified advertisement. And the vehicle-based media system could then cause the external computing device to display the retrieved visual content.

Further, this visual content can take various forms. As noted above, for instance, the visual content could include a scannable identifiers (e.g., matrix barcodes, such as Quick Response (QR) codes and/or other image recognition technologies), that when scanned causes a computing system to access additional content associated with the advertisement. In one example, the scannable identifier can include a QR code or a bar code that is scannable by a point-of-sale device to apply a discount to a purchase, such that the occupant can present the scannable identifier at a business associated with the advertisement in order to obtain the discount. In another example, the visual content can include a logo, product information, or a website of the business associated with the advertisement. This visual content may contain additional content that can increase opportunities for (and the likelihood of) an occupant's interactions with the advertisement content. Other examples are possible as well.

Before causing the external computing device to perform the identified action (e.g., displaying the retrieved visual content), however, the vehicle-based media system can first identify the external computing device as being eligible to perform the action. For instance, the vehicle-based media system can identify the external computing device based on the external computing device being connected to the vehicle-based media system via a connection mechanism (e.g., via a wired connection, such as a USB, Mini-USB, or Lightning connection, or via a wireless connection, such as a Bluetooth or WI-FI connection).

Responsive to determining that the external computing device is connected to the vehicle-based media system, the vehicle-based media system can determine that the external computing device is eligible to perform the action, and the vehicle-based media system can cause the external computing device to perform the action by sending an instruction to the computing device via the connection mechanism. In the above example in which the action involves displaying visual content, the vehicle-based media system can send the visual content and an instruction to display the visual content via the connection mechanism to the external computing device. Responsive to receiving the visual content and the instruction, the external computing device can display the visual content.

Further, in some scenarios, multiple external computing devices may be connected to the vehicle-based media system. Under these circumstances, the vehicle-based media system can cause one of the external computing devices to perform the identified action, or the vehicle-based media system can cause multiple ones or even all of the connected external computing devices to perform the identified action.

As a related feature, the vehicle-based media system can also determine location data associated with the vehicle. The location data can include a current location of the vehicle, a destined location of the vehicle, or some location on a route between the current and destined locations of the vehicle, as obtained from a navigation system of the vehicle. The vehicle-based media system can use both the identified advertisement and the determined location data as a basis to retrieve the visual content. For instance, the vehicle-based media system can receive and present a terrestrial radio advertisement for a restaurant chain and while doing so, the media system can display a location of a local one of the restaurants.

In still other examples, the vehicle-based media system can use the determined location data to determine whether to identify an action and cause the external device to perform the identified action. To facilitate this, the server can be further provisioned with respective location data associated with each respective advertisement. For instance, the location data for a particular advertisement can include an address or GPS coordinates of a business associated with the advertisement. In such a scenario, the vehicle-based media system can be configured to only identify an action for advertisements that are associated with a location within a threshold distance (e.g., within a five mile radius) of the location identified by the vehicle location data. Accordingly, in some examples, even if a fingerprint match is detected when comparing the query audio fingerprint data with the reference audio fingerprint data of the known advertisements, the vehicle-based media system can refrain from identifying an action associated with the matched reference fingerprint data if the matched reference fingerprint data is also associated with a location that is at least a threshold distance away from the location identified by the vehicle location data.

II. Example Architecture

A. Computing Device

FIG. 1 is a simplified block diagram of an example computing device 100. Computing device 100 can perform various acts and/or functions, such as those described in this disclosure. Computing device 100 can include various components, such as processor 102, data storage unit 104, communication interface 106, and/or user interface 108. These components can be connected to each other (or to another device, system, or other entity) via connection mechanism 110.

Processor 102 can include a general-purpose processor (e.g., a microprocessor) and/or a special-purpose processor (e.g., a digital signal processor (DSP)).

Data storage unit 104 can include one or more volatile, non-volatile, removable, and/or non-removable storage components, such as magnetic, optical, or flash storage, and/or can be integrated in whole or in part with processor 102. Further, data storage unit 104 can take the form of a non-transitory computer-readable storage medium, having stored thereon program instructions (e.g., compiled or non-compiled program logic and/or machine code) that, when executed by processor 102, cause computing device 100 to perform one or more acts and/or functions, such as those described in this disclosure. As such, computing device 100 can be configured to perform one or more acts and/or functions, such as those described in this disclosure. Such program instructions can define and/or be part of a discrete software application. In some instances, computing device 100 can execute program instructions in response to receiving an input, such as from communication interface 106 and/or user interface 108. Data storage unit 104 can also store other types of data, such as those types described in this disclosure.

Communication interface 106 can allow computing device 100 to connect to and/or communicate with another other entity according to one or more protocols. In one example, communication interface 106 can be a wired interface, such as an Ethernet interface or a high-definition serial-digital-interface (HD-SDI). In another example, communication interface 106 can be a wireless interface, such as a radio, cellular, or WI-FI interface. In this disclosure, a connection can be a direct connection or an indirect connection, the latter being a connection that passes through and/or traverses one or more entities, such as such as a router, switcher, or other network device. Likewise, in this disclosure, a transmission can be a direct transmission or an indirect transmission.

User interface 108 can facilitate interaction between computing device 100 and a user of computing device 100, if applicable. As such, user interface 108 can include input components such as a keyboard, a keypad, a mouse, a touch-sensitive panel, a microphone, and/or a camera, and/or output components such as a display device (which, for example, can be combined with a touch-sensitive panel), a sound speaker, and/or a haptic feedback system. More generally, user interface 108 can include hardware and/or software components that facilitate interaction between computing device 100 and the user of the computing device 100.

In this disclosure, the term “computing system” means a system that includes at least one computing device, such as computing device 100. A computing system and/or components thereof can perform various acts, such as those set forth below.

In this disclosure, the term “vehicle-based media system” means a system that includes at least one computing device, such as computing device 100, and that receives and presents media content to an occupant of a vehicle. The vehicle-based media system can receive the media content using a communication interface, such as communication interface 106, and can present the media content using a user interface, such as user interface 108.

II. Example Operations

In line with the discussion above, a vehicle-based media system can receive, via a radio broadcast, an audio advertisement, and the vehicle-based media system can present the audio advertisement to an occupant of the vehicle by outputting the audio advertisement through one or more speakers of the vehicle. To enhance the occupant's experience, the vehicle-based media system can identify the advertisement and can use the identified advertisement as a basis to identify an action. The vehicle-based media system can then cause an external computing device, such as the occupant's mobile phone, to perform the identified action, while the vehicle-based media system presents at least a portion of the advertisement. In one example use case, the vehicle-based media system can receive and present a terrestrial radio advertisement for a restaurant and while doing so, the vehicle-based media system can cause the occupant's mobile phone to display an image of a food item available at the restaurant together with a scannable identifier, such as a QR code, that links to a corresponding coupon.

One way to identify the advertisement being received by the vehicle-based media system is for the vehicle-based media system to generate audio fingerprint data representing the advertisement content being received by the vehicle-based media system, and for a computing system, such as a network server, to compare that audio fingerprint data with reference audio fingerprint data representing advertisement content of a known advertisement, with a fingerprint match indicating that the vehicle-based media system is receiving the known advertisement.

To do so, the vehicle-based media system may use one or more microphones to capture audio content that is played via one or more speakers in the vehicle. For instance, the vehicle-based media system may receive various audio content from a particular source (e.g., both entertainment and advertisement content contained in signals from a terrestrial radio station) and selectively capture specific audio content (e.g., advertisement content) for further analysis. To facilitate this, the vehicle-based media system may be configured to continuously monitor audio content that is being played in the vehicle during background processing and begin capturing the audio only upon recognizing one or more predetermined audio patterns. For instance, the vehicle-based media system can be configured to begin capturing audio content responsive to detecting a period of silence or lack of music content, as this can be indicative of the end of a song and the beginning of an advertisement.

In another example, the vehicle-based media system can be configured to capture audio content responsive to detecting changes in the received audio content. For instance, the vehicle-based media system can be configured to determine that the received audio content no longer includes information identifying the song or radio program and/or a source of the song or radio program, such as information encoded into the radio signal according to the RBDS communication protocol. As noted above, this encoded information might not be present when the received audio content includes advertisement content, and so the lack of this encoded information can indicate that the received audio content includes advertisement content.

In another example, the vehicle-based media system may be configured to continuously or periodically monitor the audio content that is being played in the vehicle and capture the audio content for further analysis. In some examples, the vehicle-based media system may periodically capture the audio content and perform audio analysis (e.g., fingerprinting, finding other audio content that matches the captured audio content, etc.) in real time or near real time. In other examples, however, the vehicle-based media system may periodically capture the audio content and defer audio analysis until a later time.

In still other examples, the vehicle-based media system may engage the one or more microphones to capture audio content based on the presence (or absence) or particular information in the received audio content. In some examples, the vehicle-based media system may begin capturing audio content playing back in the vehicle in response to determining that the received audio content does not contain information indicating that the audio content is musical audio content (e.g., song and/or artist title), and therefore is likely advertising content. Other examples are possible.

In any case, the vehicle-based media system can determine a fingerprint for the captured audio content, and may do so in a number of ways. For example, a network server or other such entity could establish or otherwise have access to reference audio fingerprint data representing a known advertisement. For instance, the server could receive various audio advertisements and, for each audio advertisement, could generate respective reference audio fingerprints representing the respective audio advertisements. Or the server could otherwise establish or be provisioned with these reference audio fingerprints. The fingerprints determined by the vehicle-based media system and/or the reference audio fingerprints can be generated according to any audio fingerprinting process now known or later developed.

Further, in real time as the vehicle-based media system receives an audio advertisement, the vehicle-based media system could generate query audio fingerprint data representing the audio content of the advertisement that it is receiving and could transmit the query audio fingerprint data to the server for analysis.

As the server receives the query audio fingerprint data from the vehicle-based media system, the server could compare the query audio fingerprint data with the reference audio fingerprint data of the known advertisements. And if the server thereby determines with sufficient confidence that the query audio fingerprint data matches particular reference audio fingerprint data, then the server could conclude that the advertisement being received by the vehicle-based media system is the known advertisement represented by the particular reference audio fingerprint data. Further, upon reaching such a conclusion, the server could transmit, to the vehicle-based media system, information associated with the advertisement (e.g., a logo, a brand name, etc., associated with the advertisement and/or the associated business).

In other examples, once the advertisement that is being received by the vehicle-based media system has been identified, the identified advertisement can be used as a basis to retrieve visual content. For instance, the server could receive or be provisioned with respective visual content associated with each respective known advertisement, and the vehicle-based media system could retrieve, from the server, the respective visual content associated with the identified advertisement. In yet other examples, the vehicle-based media system may perform some or all of these functions locally, on the system itself.

In any event, once the visual content is received and/or retrieved by the vehicle-based media system, the system can then cause an external computing device to display the retrieved visual content, and this display can take various forms. As noted above, for instance, the visual content could include a scannable identifier, such as a QR code, that when scanned causes a computing system (e.g., an occupant's mobile computing device) to access additional content associated with the advertisement. This scannable identifier can also be scannable by a point-of-sale device to apply a discount to a purchase, such that the occupant can present the scannable identifier at a business associated with the advertisement in order to obtain the discount. In other examples, the visual content can include a logo, product information, or a website of the business associated with the advertisement. Other examples are possible as well.

For example, as a related feature, the vehicle-based media system can also determine location data associated with the vehicle (e.g., a current location of the vehicle, a destined location of the vehicle as obtained from a navigation system of the vehicle, or a location on a navigational route between the current location and the destined location). The vehicle-based media system can use both the identified advertisement and the determined location data as a basis to retrieve the visual content. For instance, the vehicle-based media system can receive and present a terrestrial radio advertisement for a restaurant chain and while doing so, the media system can display a location of one of the restaurants.

In some examples, the vehicle-based media system can also use the determined location data of the vehicle to determine whether to retrieve particular visual content and cause the external computing device to display the retrieved visual content. To facilitate this, the server can be further provisioned with respective location data associated with each respective advertisement. For instance, the location data for a particular advertisement can include an address or GPS coordinates of a business associated with the advertisement. In such a scenario, the vehicle-based media system can be configured to only retrieve visual content for advertisements that are associated with a location within a threshold distance (e.g., within a five mile radius) of the current location of the vehicle, destined location of the vehicle, or some location on a route between the current and destined locations, and send instructions for the external computing device to display this content.

For instance, the vehicle-based media system can determine that the vehicle location data identifies a location that is within a threshold proximity/distance to a location associated with a particular advertisement. In such a scenario, the vehicle-based media system can send instructions to the external computing device that cause the external computing device to retrieve and/or display particular visual content associated with the particular advertisement. Alternatively, the vehicle-based media system can determine that the vehicle location data identifies a location that is not within the threshold proximity/distance to the location associated with the particular advertisement. In such a scenario, even if a fingerprint match is detected when comparing the query audio fingerprint data with the reference audio fingerprint data of the particular advertisement, the vehicle-based media system can refrain from retrieving and/or causing the external computing device to display visual content associated with the matched reference fingerprint data based on the matched reference fingerprint data being associated with a location that is at least the threshold distance away from the location indicated by the vehicle location data.

Further, in some examples, location data for a particular advertisement can include location data for multiple locations. For instance, as noted above, the particular advertisement might be an advertisement for a chain of restaurants, and the location data for the particular advertisement can include location data for each restaurant of the chain of restaurants. As such, the vehicle-based media system can be configured to consider the nearest restaurant location when determining whether to perform the navigation-related action. For instance, if the nearest restaurant is within the threshold proximity/distance of the vehicle-based media system location, then the vehicle-based media system can perform the navigation-related action, such as by displaying a prompt to navigate the vehicle to the nearest restaurant location. Alternatively, if the nearest restaurant is not within the threshold proximity/distance of the vehicle-based media system location, then the vehicle-based media system can refrain from performing the navigation-related action.

In any event, the vehicle-based media system, an occupant's mobile device, or both may take actions that result in additional content related to the advertisement being displayed via one or more external computing devices (e.g., the occupant's mobile device) and thereby increase therein interactions with (and interest in) the advertisement (and the underlying business/products/services advertised therein). In a further aspect, the vehicle-based media system, the external computing device, and/or both may also store the associated information (e.g., reference audio content, associated visual content, the occupant's interactions with the visual content, or all of the above) for future use by the media system and/or external computing device.

A. Example GUIs and Operational Environments

To further illustrate the above-described concepts and others, FIGS. 2A-2C depict example GUIs that a vehicle-based media system 200 can be configured to display in accordance with the techniques described above. The vehicle-based media system 200 can include one or more computing devices similar to or the same as the computing device 100 depicted in FIG. 1. Further, the example environments and GUIs depicted in FIGS. 2A-2C are merely for purposes of illustration. The features described herein can involve environments, operations, GUIs, and functionalities that are configured or formatted differently, include additional or fewer components and/or more or less data, include different types of components and/or data, and relate to one another in different ways.

FIG. 2A depicts an example GUI of the vehicle-based media system 200 in a first state. Specifically, in this first state, the vehicle-based media system 200 displays a radio station 202 (“95.5”) in connection with an associated radio station logo 204 (“Hot 95.5”) and information indicating a song title 206 (“Brimstone”), artist title 208 (“The Wayward Kin”), and album title 210 (“Meanwhile in Mississippi . . . ”), for musical audio content received from the displayed radio station and currently playing on one or more speakers in the vehicle. In line with the discussion above, this information can be encoded into audio content received by the vehicle-based media system, and the vehicle-based media system can extract this information from the received audio content.

In FIG. 2A, the vehicle-based media system 200 is also connected to and in communication with an external computing device 212 via connection mechanism 214. As noted above, connection mechanism 214 may take various forms, including via a wired connection (e.g., a USB, Mini-USB, or Lightning connection) and/or via a wireless connection (e.g., such as a Bluetooth or WI-FI connection). As shown in FIG. 2A, the external computing device 212 is not displaying any additional visual content associated with the audio content received by the vehicle-based media system. In some examples, the vehicle-based media system 200 might refrain from causing the external computing device 212 to display additional visual content when the received audio content includes entertainment content. Rather, as explained in further detail below, the vehicle-based media system 200 can cause the external computing device 212 to display additional visual content when the received audio content includes advertisement content.

FIG. 2B depicts the GUI of vehicle-based media system 200 depicted in FIG. 2A, but in a second state. In the second state, the audio content received by the vehicle-based media system 200 includes advertisement content instead of entertainment content. In this state, instead of displaying information indicating a song title 206, artist title 208, and album title 210 associated with musical audio content playing in the vehicle, vehicle-based media system 200 displays content associated with the advertisement playing on the radio station. Further, the vehicle-based media system 200 sends instructions to external computing device 212 to take one or more computational actions to provide additional content based on the advertisement playing on the radio station and/or a location associated with the vehicle, among other possibilities. By doing so, during playout of this advertisement, the vehicle-based media system 200 captures the advertisement audio content currently playing on one or more speakers in the vehicle and provides additional content to one or more occupants of the vehicle via display on the vehicle-based media system, the external computing device, and/or both.

In line with the discussion above, the vehicle-based media system 200 can capture the advertisement audio content in a number of ways. For example, vehicle-based media system 200 can include one or more microphones that the vehicle-based media system 200 can use to capture audio content playing in the vehicle during an advertisement. Further, the vehicle-based media system 200 can continuously or periodically monitor audio content that is being played in the vehicle device and begin capturing audio content upon recognizing one or more predetermined audio patterns that might indicate that an advertisement is playing. In this way, the audio content in the vehicle can be continuously monitored for purposes of determining if an advertisement is playing.

In still other examples, the vehicle-based media system 200 may capture audio content in the vehicle based on the presence (or absence) or particular information in the received audio content. For example, the vehicle-based media system 200 may begin capturing audio content playing in the vehicle in response to determining that the received audio content contains information indicating that the audio content is an advertisement. For instance, the vehicle-based media system can begin capturing audio content responsive to determining that the received audio content contains embedded information that indicates the audio content is an advertisement, but does not indicate any details of the advertisement. In other examples, the vehicle-based media system 200 may begin capturing audio content playing in the vehicle in response to determining that the received audio content does not contain certain information. For example, the vehicle-based media system 200 may determine that the audio content does not contain information indicating that the audio content is musical content (e.g., song and/or artist title), and thereby determine that the audio content is more likely than not an advertisement. Other examples are possible.

The vehicle-based media system 200 can also use the captured audio content to identify reference audio content that has at least a threshold extent of similarity with the captured audio content and can do so in a number of ways.

In one example, the vehicle-based media system 200 itself can generate audio fingerprint data based on the captured audio content (e.g., by processing the captured audio content locally) and then send the generated audio fingerprint data to a second computing device (e.g., a network server) with an instruction that causes the second computing device to compare the generated audio fingerprint data to a plurality of reference audio fingerprint items and determine that one or more reference audio fingerprint items have at least a threshold extent of similarity with the generated audio fingerprint data. This similarity may indicate a match between the captured audio content and known reference audio content, thereby allowing the vehicle-based media system 200 to display information about the captured audio content based on its similarity to the known reference audio content (e.g., names and/or slogans for certain businesses).

In another example, the vehicle-based media system 200 can identify the reference audio content locally by generating audio fingerprint data of the captured audio content and comparing the generated audio fingerprint data to a plurality of locally stored reference audio fingerprint items, each of which are associated with respective reference audio content. Then, based on this comparison, the vehicle-based media system 200 can also determine that a particular reference audio fingerprint item has at least a threshold extent of similarity with the generated audio fingerprint data.

In yet another example, the vehicle-based media system 200 can send a portion of the captured audio content to an external computing device (e.g., a network server) with an instruction that causes the external computing device to compare, at least, the portion of the captured audio content to the plurality of reference audio content items. Such a comparison can involve the external computing device generating an audio fingerprint of the portion of the captured audio content and comparing the generated audio fingerprint to respective reference audio fingerprints, each of which can correspond to a respective reference audio content item. Then, once the external computing device performs this comparison, the vehicle-based media system 200 may receive, from the external computing device, an indication of a particular reference audio content item that has at least the threshold extent of similarity with the captured audio content.

Based on the results of this comparison, the vehicle-based media system 200 can also identify one or more computational action based at least on the identified reference audio content and send, via the network interface of the vehicle-based media system, an instruction that causes an external computing device to perform the identified one or more computational actions.

In some examples, this may include retrieving visual content associated with the reference audio content and displaying it via the vehicle-based media system, the external computing device, or both. In some examples, to identify this associated visual content, the vehicle-based media system 200 may use the reference audio content corresponding to the fingerprint data of the advertisement to retrieve visual content associated with one or more advertisements, and may do so in a variety of ways.

In some examples, the vehicle-based media system 200 may obtain this visual content from local storage and/or from other devices, among other possibilities. In a further aspect, the vehicle-based media system may identify respective visual content for each (or at least some) of the respective reference audio fingerprint item of the plurality of audio fingerprint items based on a variety of factors (e.g., identify visual content that has the greatest extent of similarity with the reference audio content corresponding to the generated audio fingerprint data or multiple visual contents that have at least a threshold extent of similarity with the same). The external computing device can be used to perform some (or all) of these steps as well. Further, the external computing device can perform some (or all) of these actions instead of the vehicle-based computing system (e.g., to preserve computational processing of the vehicle-based computing system) or in addition to the vehicle-based computing system (e.g., to potentially increase the accuracy of the fingerprinting process and/or visual content retrieval and display). Other examples are possible.

In any case, as shown in FIG. 2B, once the vehicle-based media system 200 has identified particular reference audio content and visual content associated with that particular reference audio content, the vehicle-based media system 200 can send instructions to external computing device 212 to take one or more additional computational actions. For example, the vehicle-based media system 200 can send instructions to the external computing device 212 to navigate to a web page associated with the particular advertisement content (e.g., the homepage URL for the advertised business) and/or retrieve a coupon associated with the particular advertisement content. In a further aspect, after retrieving this coupon, vehicle-based media system 200 can send further instructions to the external computing device 212 to display an indication of the particular advertisement content and/or additional advertisement content associated with the particular advertisement content (which may be retrieved by the vehicle-based media system 200, external computing device 212, or both).

Further, the vehicle-based media system 200 and external computing device 212 can use this identified visual content in a number of ways. In one example, the vehicle-based media system 200 and external computing device 212 can display visual content related to particular advertisement while the advertisement is playing in the vehicle. In a further aspect, this visual content may contain content additional content that increases opportunities for (and the likelihood of) an occupant's interactions with the advertisement content.

For example, as shown in FIG. 2B, the vehicle-based media system 200 displays visual content 216 corresponding to a particular advertisement, but also sends instructions to external computing device 212 (via connection mechanism 214) to display additional advertising content that includes a business logo 218 (here, “Core4 Coffee Roasters”) associated with the advertisement and an offer 220 (here, “10% Off Any Size Latte *Eligible at All Locations*”), and a scannable image 222 (here, a QR code). Again, this content may increase an occupant's experience with the underlying advertisement content (e.g., by providing even more content to an occupant of the vehicle when scanned by an occupant's mobile computing device). Other examples are possible.

Further, this advertising visual content may be displayed at various times on the vehicle-based media system 200, external computing device 212, or both, in connection with the underlying advertisement content. For example, the advertising visual content may be retrieved and displayed in real time while the underlying advertisement content is playing in the vehicle. Further, the advertising visual content may be displayed only while the underlying advertisement content is playing in the vehicle, during a portion of playback for a subsequent advertisement or musical audio content, for a particular duration (e.g., 30 seconds), persistently until an occupant takes one or more predetermined actions (e.g., scans the associated QR code, or closes the advertisement by interacting with the user interface of the vehicle-based media system), and/or when the vehicle's radio is turned off and/or performing other functionality (e.g., playing a CD), among other possibilities. For example, this advertising visual content may also be displayed the next time the vehicle is started, regardless of the audio content that is playing when the vehicle is started. Other examples and uses of associated information are possible.

Turning to FIG. 2C, the GUI of the vehicle-based media system 200 is shown in a third state, where the third state is similar to the second state depicted in FIG. 2B, in that the vehicle-based media system 200 has captured audio content, used that captured audio content to identify reference audio content having at least a threshold similarity with the captured audio content, and identified associated visual content. However, in this third state, the vehicle-based media system 200 has also captured location data of the vehicle and used that location data to identify and display the advertising visual content 212.

Specifically, in order to reach the third state shown in FIG. 2C, at or around the time the vehicle-based media system 200 uses the microphone to capture audio content, the vehicle-based media system 200 can also determine location data of the vehicle (e.g., using GPS). As described above, the vehicle-based media system 200 can use the determined location data to filter out visual content that is associated with locations outside a threshold proximity of the location data and instead only select visual content that is within the threshold proximity of the location data. In this manner, the selected visual content can be location-specific visual content that is associated with a place of business that is relatively near to the vehicle-based media system.

In some examples, the vehicle-based media system 200 may obtain this geographic location data from local storage and/or from other devices, among other possibilities. In a further aspect, the vehicle-based media system may identify respective geographic location data for each (or at least some) of the respective reference audio fingerprint item of the plurality of audio fingerprint items based on a variety of factors.

In any case, as shown in FIG. 2C, once the vehicle-based media system 200 has identified particular reference audio content and visual content and geographic content associated with that particular reference audio content, the vehicle-based media system 200 can send instructions to the external computing device 212 to take one or more additional computational actions.

As shown in FIG. 2C, the vehicle-based media system can send instructions to the external computing device 212 to display, in connection with the indication of the identified reference audio content, an indication of the location-specific visual content 224 (shown here as “10% Off Any Size Latte *Eligible at 300 S. Wacker Location*”). Additionally or alternatively, the location-specific visual content 224 can include offers and/or other content that pertains to the indicated, specific participating locations (e.g., displaying particular menu items found only at a particular location, directions to the particular location, an option to order in advance of arriving at the particular location, etc.). Some or all of this location-specific content can be encoded into scannable image 226.

The vehicle-based media system 200 and external computing device 212 can perform the above processes in various ways. For example, the vehicle-based media system 200 may send instructions to the external computing device 212 to navigate to a web page associated with the particular advertisement content (e.g., the homepage URL for the advertised business) and retrieve a coupon associated with the particular advertisement content. In a further aspect, after retrieving this coupon, vehicle-based media system 200 may send further instructions to the external computing device 212 to display an indication of a geographic location associated with the particular advertisement content and/or additional advertisement content associated with the particular advertisement content (which may be retrieved by the vehicle-based media system 200, external computing device 212, or both).

In any event, by displaying location-specific visual content, the occupant's experience with the advertisement (and thereby the advertiser) may be further improved.

B. Example Methods

FIG. 3 is a flow chart illustrating an example method 300.

At block 302, the method 300 can include receiving audio content. In some examples, receiving the audio content includes receiving primary audio content and secondary audio content, wherein the primary audio content comprises entertainment content, and wherein the secondary audio content comprises advertisement content interspersed among the primary content.

At block 304, the method 300 can include causing one or more speakers to output the received audio content. In some examples, causing the one or more speakers to output the received audio content includes causing the one or more speakers to output primary audio content (i.e., entertainment content) and secondary audio content (i.e., advertisement content).

At block 306, the method can include using the microphone of the vehicle-based media system to capture the output audio content. In some examples, using the microphone of the vehicle-based media system to capture the output audio content includes using the microphone of the vehicle-based media system to capture the secondary audio content.

At block 308, the method 300 can include identifying reference audio content that has at least a threshold extent of similarity with the captured audio content. In some examples, identifying reference audio content that has at least the threshold extent of similarity with the captured audio content includes identifying reference audio content that has at least the threshold extent of similarity with the captured secondary audio content. In some examples, identifying the reference audio content that has at least the threshold extent of similarity with the captured audio content includes: (i) generating audio fingerprint data of the captured audio content, (ii) comparing the generated audio fingerprint data to a plurality of reference audio fingerprint items, and (iii) determining that a particular reference audio fingerprint item of the plurality of reference audio fingerprint items has at least the threshold extent of similarity with the generated audio fingerprint data.

At block 310, the method 300 can include identifying a computational action based at least on the identified reference audio content. In some examples, identifying the computational action based at least on the identified reference audio content includes identifying a computational action related to particular advertisement content of the secondary audio content. In some examples, each respective reference audio fingerprint item of the plurality of audio fingerprint items is associated with respective computational action and identifying the computational action based at least on the identified reference audio content includes identifying the respective computational action associated with the particular reference audio fingerprint item that has at least the threshold extent of similarity with the generated audio fingerprint data.

At block 312, the method 300 can include sending, via the network interface of the vehicle-based media system, an instruction that causes the external computing device to perform the identified computational action. In some examples, sending the instruction includes, while causing the one or more speakers to output the particular advertisement content of the secondary audio content, sending an instruction that causes the external computing device to perform the computational action related to the particular advertisement content. In some examples, sending the instruction that causes the external computing device to perform the computational action related to the particular advertisement content includes sending an instruction that causes an external computing device to display a prompt that, when selected by a user, causes the external computing device to (i) navigate to a web page associated with the particular advertisement content, (ii) retrieve a coupon associated with the particular advertisement content, (iii) display an indication of a geographic location associated with the particular advertisement content, and/or (iv) display additional advertisement content associated with the particular advertisement content.

In some examples, the method 300 can also include determining a geographic location of the vehicle-based media system and wherein identifying the computational action based at least on the identified reference audio content includes identifying the computational action based on both (i) the identified reference audio content and (ii) the determined geographic location of the vehicle-based media system. As described above, this can allow the vehicle-based media system to retrieve and display location-specific visual content.

IV. Example Variations

Although some of the acts and/or functions described in this disclosure have been described as being performed by a particular entity, the acts and/or functions can be performed by any entity, such as those entities described in this disclosure. Further, although the acts and/or functions have been recited in a particular order, the acts and/or functions need not be performed in the order recited. However, in some instances, it can be desired to perform the acts and/or functions in the order recited. Further, each of the acts and/or functions can be performed responsive to one or more of the other acts and/or functions. Also, not all of the acts and/or functions need to be performed to achieve one or more of the benefits provided by this disclosure, and therefore not all of the acts and/or functions are required.

Although certain variations have been discussed in connection with one or more examples of this disclosure, these variations can also be applied to all of the other examples of this disclosure as well.

Although select examples of this disclosure have been described, alterations and permutations of these examples will be apparent to those of ordinary skill in the art. Other changes, substitutions, and/or alterations are also possible without departing from the invention in its broader aspects. 

The invention claimed is:
 1. A vehicle-based media system comprising: a network interface for communicating with an external computing device that is distinct from the vehicle-based media system; a microphone; one or more processors; and a non-transitory, computer-readable medium storing instructions that, when executed by the one or more processors, cause the vehicle-based media system to perform a set of operations comprising: receiving audio content; causing one or more speakers to output the received audio content; using the microphone of the vehicle-based media system to capture the output audio content; identifying reference audio content that has at least a threshold extent of similarity with the captured audio content; identifying a computational action based at least on the identified reference audio content; and sending, via the network interface of the vehicle-based media system, an instruction that causes the external computing device to perform the identified computational action.
 2. The vehicle-based media system of claim 1, wherein receiving the audio content comprises receiving primary audio content and secondary audio content, wherein causing the one or more speakers to output the received audio content comprises causing the one or more speakers to output the primary audio content and the secondary audio content, wherein the primary audio content comprises entertainment content, and wherein the secondary audio content comprises advertisement content interspersed among the primary content.
 3. The vehicle-based media system of claim 2, wherein: using the microphone of the vehicle-based media system to capture the output audio content comprises using the microphone of the vehicle-based media system to capture the secondary audio content; and identifying reference audio content that has at least the threshold extent of similarity with the captured audio content comprises identifying reference audio content that has at least the threshold extent of similarity with the captured secondary audio content.
 4. The vehicle-based media system of claim 3, wherein identifying the computational action based at least on the identified reference audio content comprises identifying a computational action related to particular advertisement content of the secondary audio content, and wherein sending the instruction that causes the external computing device to perform the identified computational action comprises, while causing the one or more speakers to output the particular advertisement content of the secondary audio content, sending an instruction that causes the external computing device to perform the computational action related to the particular advertisement content.
 5. The vehicle-based media system of claim 4, wherein sending the instruction that causes the external computing device to perform the computational action related to the particular advertisement content comprises sending an instruction that causes the external computing device to display a prompt that, when selected by a user, causes the external computing device to (i) navigate to a web page associated with the particular advertisement content, (ii) retrieve a coupon associated with the particular advertisement content, (iii) display an indication of a geographic location associated with the particular advertisement content, or (iv) display additional advertisement content associated with the particular advertisement content.
 6. The vehicle-based media system of claim 1, wherein identifying the reference audio content that has at least the threshold extent of similarity with the captured audio content comprises: generating audio fingerprint data of the captured audio content; comparing the generated audio fingerprint data to a plurality of reference audio fingerprint items; and determining that a particular reference audio fingerprint item of the plurality of reference audio fingerprint items has at least the threshold extent of similarity with the generated audio fingerprint data.
 7. The vehicle-based media system of claim 6, wherein each respective reference audio fingerprint item of the plurality of audio fingerprint items is associated with a respective computational action, and wherein identifying the computational action based at least on the identified reference audio content comprises identifying the respective computational action associated with the particular reference audio fingerprint item that has at least the threshold extent of similarity with the generated audio fingerprint data.
 8. The vehicle-based media system of claim 1, wherein the set of operations further comprise determining a geographic location of the vehicle-based media system, and wherein identifying the computational action based at least on the identified reference audio content comprises identifying the computational action based on both (i) the identified reference audio content and (ii) the determined geographic location of the vehicle-based media system.
 9. A method to be performed by a vehicle-based media system, the method comprising: receiving audio content; causing one or more speakers to output the received audio content; using a microphone of the vehicle-based media system to capture the output audio content; identifying reference audio content that has at least a threshold extent of similarity with the captured audio content; identifying a computational action based at least on the identified reference audio content; and sending, via a network interface of the vehicle-based media system, an instruction that causes an external computing device to perform the identified computational action.
 10. The method of claim 9, wherein receiving the audio content comprises receiving primary audio content and secondary audio content, wherein causing the one or more speakers to output the received audio content comprises causing the one or more speakers to output the primary audio content and the secondary audio content, wherein the primary audio content comprises entertainment content, and wherein the secondary audio content comprises advertisement content interspersed among the primary content.
 11. The method of claim 10, wherein: using the microphone of the vehicle-based media system to capture the output audio content comprises using the microphone of the vehicle-based media system to capture the secondary audio content; and identifying reference audio content that has at least the threshold extent of similarity with the captured audio content comprises identifying reference audio content that has at least the threshold extent of similarity with the captured secondary audio content.
 12. The method of claim 11, wherein identifying the computational action based at least on the identified reference audio content comprises identifying a computational action related to particular advertisement content of the secondary audio content, and wherein sending the instruction that causes the external computing device to perform the identified computational action comprises, while causing the one or more speakers to output particular advertisement content of the secondary audio content, sending an instruction that causes the external computing device to perform the computational action related to the particular advertisement content.
 13. The method of claim 12, wherein sending the instruction that causes the external computing device to perform the computational action related to the particular advertisement content comprises sending an instruction that causes the external computing device to display a prompt that, when selected by a user, causes the external computing device to (i) navigate to a web page associated with the particular advertisement content, (ii) retrieve a coupon associated with the particular advertisement content, (iii) display an indication of a geographic location associated with the particular advertisement content, or (iv) display additional advertisement content associated with the particular advertisement content.
 14. The method of claim 9, wherein identifying the reference audio content that has at least the threshold extent of similarity with the captured audio content comprises: generating audio fingerprint data of the captured audio content; comparing the generated audio fingerprint data to a plurality of reference audio fingerprint items; and determining that a particular reference audio fingerprint item of the plurality of reference audio fingerprint items has at least the threshold extent of similarity with the generated audio fingerprint data.
 15. The method of claim 14, wherein each respective reference audio fingerprint item of the plurality of audio fingerprint items is associated with a respective computational action, and wherein identifying the computational action based at least on the identified reference audio content comprises identifying the respective computational action associated with the particular reference audio fingerprint item that has at least the threshold extent of similarity with the generated audio fingerprint data.
 16. The method of claim 9, further comprising determining a geographic location of the vehicle-based media system, wherein identifying the computational action based at least on the identified reference audio content comprises identifying the computational action based on both (i) the identified reference audio content and (ii) the determined geographic location of the vehicle-based media system.
 17. A non-transitory, computer-readable medium, having stored thereon program instructions that, upon execution by a vehicle-based media system, cause the vehicle-based media system to perform a set of operations comprising: receiving audio content; causing one or more speakers to output the received audio content; using a microphone of the vehicle-based media system to capture the output audio content; identifying reference audio content that has at least a threshold extent of similarity with the captured audio content; identifying a computational action based at least on the identified reference audio content; and sending, via a network interface of the vehicle-based media system, an instruction that causes an external computing device to perform the identified computational action.
 18. The non-transitory, computer-readable medium of claim 17, wherein: receiving the audio content comprises receiving primary audio content and secondary audio content, the primary audio content comprising entertainment content, and the secondary audio content comprising advertisement content interspersed among the primary content; causing the one or more speakers to output the received audio content comprises causing the one or more speakers to output the primary audio content and the secondary audio content; using the microphone of the vehicle-based media system to capture the output audio content comprises using the microphone of the vehicle-based media system to capture the secondary audio content; identifying reference audio content that has at least the threshold extent of similarity with the captured audio content comprises identifying reference audio content that has at least the threshold extent of similarity with the captured secondary audio content; identifying the computational action based at least on the identified reference audio content comprises identifying a computational action related to particular advertisement content of the secondary audio content; and sending the instruction that causes the external computing device to perform the identified computational action comprises, while causing the one or more speakers to output particular advertisement content of the secondary audio content, sending an instruction that causes the external computing device to perform the computational action related to the particular advertisement content.
 19. The non-transitory, computer-readable medium of claim 18, wherein sending the instruction that causes the external computing device to perform the computational action related to the particular advertisement content comprises sending an instruction that causes the external computing device to display a prompt that, when selected by a user, causes the external computing device to (i) navigate to a web page associated with the particular advertisement content, (ii) retrieve a coupon associated with the particular advertisement content, (iii) display an indication of a geographic location associated with the particular advertisement content, or (iv) display additional advertisement content associated with the particular advertisement content.
 20. The non-transitory, computer-readable medium of claim 17, wherein the set of operations further comprise determining a geographic location of the vehicle-based media system, and wherein identifying the computational action based at least on the identified reference audio content comprises identifying the computational action based on both (i) the identified reference audio content and (ii) the determined geographic location of the vehicle-based media system. 